Rank in Wordlist | Frequency | Word |
---|---|---|
78866 | 2 | 50,000 |
78982 | 2 | 70,000 |
112321 | 1 | 10,000 |
112403 | 1 | 105,181 |
115041 | 1 | 5,400,000 |
115542 | 1 | 68,000 |
116046 | 1 | 9,000 |
116158 | 1 | 97,073 |
Rank in Wordlist | Frequency | Word |
---|---|---|
34662 | 8 | ވައިރަސް(Hepatitis |
51017 | 5 | ސޯދިގް(ސޯބެ |
52337 | 4 | ހާރޕީޒް(Genital |
53511 | 4 | ރަސޫލާ(ޞައްލަﷲ |
57420 | 4 | މަރުކަޒު(Respiratory |
63637 | 3 | ހޮޓެލް(ގައި |
69125 | 3 | އިޖިޕްޓައި(Aedes |
70187 | 3 | އެޗްޑީސީ(ން |
70220 | 3 | އޭސީސީ(ން |
72485 | 3 | ފަހުބައި(Ileum |
Rank in Wordlist | Frequency | Word |
---|---|---|
23865 | 13 | ރުފިޔާ)ގެ |
31783 | 9 | އެމްއޭސީއެލް)ގެ |
35957 | 8 | ޕީޖީ)ގެ |
37025 | 7 | ބީއެމްއެލް)ގެ |
42485 | 6 | އެޗްޑީސީ)އާ |
42494 | 6 | އޭސީސީ)ގެ |
44818 | 6 | ޖޭއެސްސީ)ން |
44819 | 6 | ޖޭއެސްސީ)ގެ |
45182 | 5 | ހ)ގެ |
48304 | 5 | އެމްބީސީ)ގެ |
Rank in Wordlist | Frequency | Word |
---|---|---|
36156 | 7 | %10 |
36157 | 7 | %50 |
45015 | 5 | %90 |
51787 | 4 | %25 |
51788 | 4 | %30 |
61851 | 3 | %20 |
61852 | 3 | %3 |
61853 | 3 | %40 |
61854 | 3 | %5 |
61970 | 3 | 20% |
Rank in Wordlist | Frequency | Word |
---|---|---|
51784 | 5 | ޤޫޠު''ންގެ |
52202 | 4 | ހަވީރަ'ށް |
52203 | 4 | ހަވީރު'ގައި |
56839 | 4 | އޮ'ނީލް |
62245 | 3 | alzheimer's |
79863 | 2 | ހަވީރު'ގެ |
83925 | 2 | ނޮޓިފިކޭޝަން'ގެ |
88339 | 2 | ކިތާބު''، |
90006 | 2 | ކްރޯމަސޯމް''(Chromosome)ގައި |
90833 | 2 | އައްދައުލާ''ގެ |
Rank in Wordlist | Frequency | Word |
---|---|---|
1464 | 350 | ފޮޓޯ/ |
13360 | 29 | ފޮޓޯ/އޭއެފްޕީ |
23649 | 13 | 9/11 |
29882 | 10 | ފޮޓޯ/އިބްރާހިމް |
30537 | 9 | 18/30 |
30629 | 9 | ހަވީރުފޮޓޯ/މުހައްމަދު |
31785 | 9 | އެމްޑީޕީ/ޖޭޕީ |
33125 | 8 | 11/ |
35955 | 8 | ޕީޕީއެމް/އެމްޑީއޭ |
35956 | 8 | ޕީޕީއެމް/ޖޭޕީ |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots